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Abstract 

We consider the mobile facility location (MFL) problem. We are given a set of facilities and clients 
located in a common metric space G = {V, c). The goal is to move each facility from its initial location 
to a destination (in V) and assign each client to the destination of some facility so as to minimize the sum 
of the movement-costs of the facilities and the client-assignment costs. This abstracts facility-location 
settings where one has the flexibility of moving facilities from their current locations to other destinations 
so as to serve clients more efficiently by reducing their assignment costs. 

We give the first local-search based approximation algorithm for this problem and achieve the best- 
known approximation guarantee. Our main result is (3 + e) -approximation for this problem for any 
constant e > using local search. The previous best guarantee for MFL was an 8-approximation al- 
gorithm due to 1 13 1 based on LP-rounding. Our guarantee matches the best-known approximation 
guarantee for the fc-median problem. Since there is an approximation-preserving reduction from the 
fc-median problem to MFL, any improvement of our result would imply an analogous improvement for 
the fc-median problem. Furthermore, our analysis is tight (up to o(l) factors) since the tight example for 
the local-search based 3 -approximation algorithm for fc-median can be easily adapted to show that our 
local-search algorithm has a tight approximation ratio of 3. One of the chief novelties of the analysis 
is that in order to generate a suitable collection of local-search moves whose resulting inequalities yield 
the desired bound on the cost of a local-optimum, we define a tree-like structure that (loosely speak- 
ing) functions as a "recursion tree", using which we spawn off local-search moves by exploring this 
tree to a constant depth. Our results extend to the weighted generalization wherein each facility i has a 
non-negative weight Wi and the movement cost for i is Wi times the distance traveled by i. 

1 Introduction 

Facility location problems have been widely studied in the Operations Research and Computer Science 
communities (see, e.g., 125] and the survey [20|), and have a wide range of applications. In its simplest 
version, uncapacitated facility location (UFL), we are given a set of facilities or service-providers with 
opening costs, and a set of clients that require service, and we want to open some facilities and assign clients 
to open facilities so as to minimize the sum of the facility-opening and client-assignment costs. An oft-cited 
prototypical example is that of a company wanting to decide where to locate its warehouses/distribution 
centers so as to serve its customers in a cost-effective manner. 

We consider facility-location problems that abstract settings where facilities are mobile and may be 
relocated to destinations near the clients in order to serve them more efficiently by reducing the client- 
assignment costs. More precisely, we consider the mobile facility location (MFL) problem introduced by ifTTl 

*A preliminary version (2), without the results in Section [6] appeared in the Proceedings of the 24th Annual ACM-SIAM 
Symposium on Discrete Algorithms, 2013. 
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[T3l . which generalizes the classical /c-median problem (see below). We are given a complete graph G = 
{V,Eg) with costs {c{u,v)} on the edges, a set D C V of clients with each client j having dj units of 
demand, and a set C V of k initial facility locations. We use the term facility i to denote the facility 
whose initial location is i £ T. A solution 5 to MFL moves each facility i to a final location Si ^ V (which 
could be the same as i), incurring a movement cost c{i, Si), and assigns each client j to a final location 
s £ S, incurring assignment cost djc{j, s). The total cost of S is the sum of all the movement costs and 
assignment costs. More formally, noting that each client will be assigned to the location nearest to it in S, 
we can express the cost of S as 

MFL(5) := <h Si) + d,c(i, 

where a{v) (for any node v) gives the location in S nearest to v (breaking ties arbitrarily). We assume 
throughout that the edge costs form a metric. We use the terms nodes and locations interchangeably. 

Mobile facility location falls into the genre of movement problems introduced by Demaine et al. ifTTTl . In 
these problems, we are given an initial configuration in a weighted graph specified by placing "pebbles" on 
the nodes and/or edges; the goal is to move the pebbles so as to obtain a desired final configuration while min- 
imizing the maximum, or total, pebble movement. MFL was introduced by Demaine et al. as the movement 
problem where facility- and client- pebbles are placed respectively at the initial locations of the facilities and 
clients, and in the final configuration every client-pebble should be co-located with some facility -pebble. 

Our results. We give the first local-search based approximation algorithm for this problem and achieve 
the best-known approximation guarantee. Our main result is a (3 + e) -approximation for this problem 
for any constant e > using a simple local-search algorithm. This improves upon the previous best 8- 
approximation guarantee for MFL due to Friggstad and Salavatipour [tl3il . which is based on LP-rounding 
and is not combinatorial. 

The local-search algorithm we consider is quite natural and simple. Observe that given the final locations 
of the facilities, we can find the minimum-cost way of moving facilities from their initial locations to the final 
locations by solving a minimum-cost perfect-matching problem (and the client assignments are determined 
by the function a defined above). Thus, we concentrate on determining a good set of final locations. In 
our local-search algorithm, at each step, we are allowed to swap in and swap out a fixed number (say p) 
of locations. Clearly, for any fixed p, we can find the best local move efficiently (since the cost of a set 
of final locations can be computed in polytime). Note that we do not impose any constraints on how the 
matching between the initial and final locations may change due to a local move, and a local move might 
entail moving all facilities. It is important to allow this flexibility, as it is known |[T3l that the local-search 
procedure that moves, at each step, a constant number of facilities to chosen destinations has an unbounded 
approximation ratio. 

Our main contribution is a tight analysis of this local-search algorithm (Section [4]). Our guarantee 
matches (up to o(l) terms) the best-known approximation guarantee for the /c-median problem. Since there 
is an approximation-preserving reduction from the A;-median problem to MFL llT3l — choose arbitrary initial 
facility locations and give each client a huge demand D — any improvement of our result would imply an 
analogous improvement for the A;-median problem. (In this respect, our result is a noteworthy exception to 
the prevalent state of affairs for various other generalizations of UFL and fe-median — e.g., the data place- 
ment problem f4l, {matroid-, red-blue-} median |[22l[T6l l9ll6]|. fe-facility-location llT2l[T5l — where the best 
approximation ratio for the problem is worse by a noticeable factor (compared to UFL or A;-median); [ 14| is 
another exception.) Furthermore, our analysis is tight (up to o(l) factors) because by suitably setting D in 
the reduction of |[T3]| . we can ensure that our local-search algorithm for MFL coincides with the local-search 
algorithm for /c-median in ||3l which has a tight approximation ratio of 3. 
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We also consider a weighted generalization of the problem (Section [5]l, wherein each facility i has a 
weight Wi indicating the cost incurred per-unit distance moved and the cost for moving i to Si is Wic{i, Si). 
(This can be used to model, for example, the setting where different facilities move at different speeds.) Our 
analysis is versatile and extends to this weighted generalization to yield the same performance guarantee. For 
the further generalization of the problem, where the facility-movement costs may be arbitrary and unrelated 
to the client-assignment costs (for which a 9-approximation can be obtained via LP-rounding; see "Related 
work"), we show that local search based on multiple swaps has a bad approximation ratio (SectionjT]). 

The analysis leading to the approximation ratio of 3 (as also the simpler analysis in Section |3]yielding a 
5-approximation) crucially exploits the fact that we may swap multiple locations in a local-search move. It 
is natural to wonder then if one can prove any performance guarantees for the local-search algorithm where 
we may only swap in and swap out a single location in a local move. (Naturally, the single-swap algorithm 
is easier to implement and thus may be more practical). In Section[6j we analyze this single-swap algorithm 
and prove that it also has a constant approximation ratio. 

Our techniques. The analysis of our local-search procedure requires various novel ideas. As is common 
in the analysis of local-search algorithms, we identify a set of test swaps and use local optimality to generate 
suitable inequalities from these test swaps, which when combined yield the stated performance guarantee. 
One of the difficulties involved in adapting standard local-search ideas to MFL is the following artifact: in 
MFL, the cost of "opening" a set S of locations is the cost of the min-cost perfect matching of to S, which, 
unlike other facility-location problems, is a highly non-additive function of S (and as mentioned above, we 
need to allow for the matching from to S" to change in non-local ways). In most facility-location problems 
with opening costs for which local search is known to work, we may always swap in a facility used by the 
global optimum (by possibly swapping out another facility) and easily bound the resulting change in facility 
cost, and the main consideration is to decide how to reassign clients following the swap in a cost-effective 
way; in MFL we do not have this flexibility and need to carefully choose how to swap facilities so as to 
ensure that there is a good matching of the facilities to their new destinations after a swap and there is a 
frugal reassignment of clients. 

This leads us to consider long relocation paths to re-match facilities to their new destinations after a 
swap, which are of the form (. . . , Sj, Oj, Sj/, . . .), where Sj and Oj are the locations that facility i is moved to 
in the local and global optimum, S and O, respectively, and Sj/ is the 5-location closest to Oj. By considering 
a swap move involving the start and end locations of such a path Z, we can obtain a bound on the movement 
cost of all facilities i E Z where Si is the start of the path or Oj serves a large number of clients. To account 
for the remaining facilities, we break up Z into suitable intervals, each containing a constant number of 
unaccounted locations which then participate in a multi-location swap. This interval-swap move does not at 
first appear to be useful since we can only bound the cost-change due to this move in terms of a significant 
multiple of (a portion of) the cost of the local optimum! One of the novelties of our analysis is to show 
how we can amortize the cost of such expensive terms and make their contribution negligible by considering 
multiple different ways of covering Z with intervals and averaging the inequalities obtained for these interval 
swaps. These ideas lead to the proof of an approximation ratio of 5 for the local-search algorithm (Section|3]l. 

The tighter analysis leading to the 3-approximation guarantee (Section |4]l features another noteworthy 
idea, namely that of using "recursion" (up to bounded depth) to identify a suitable collection of test swaps. 
We consider the tree-like structure created by the paths used in the 5-approximation analysis, and (loosely 
speaking) view this as a recursion tree, using which we spawn off interval-swap moves by exploring this 
tree to a constant depth. To our knowledge, we do not know of any analysis of a local-search algorithm 
that employs the idea of recursion to generate the set of test local moves (used to generate the inequalities 
that yield the desired performance guarantee). We believe that this technique is a notable contribution to the 
analysis of local-search algorithms that is of independent interest and will find further application. 
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Related work. As mentioned earlier, MFL was introduced by Demaine et al. fTT\ in the context of move- 
ment problems. Friggstad and Salavatipour |FT3| designed the first approximation algorithm for MFL. They 
gave an 8-approximation algorithm based on LP rounding by building upon the LP-rounding algorithm of 
Charikar et al. ||8l for the fc-median problem; this algorithm works only however for the unweighted case. 
They also observed that there is an approximation-preserving reduction from A:-median to MFL. We recently 
learned that Halper [17] proposed the same local-search algorithm that we analyze. His work focuses on 
experimental results and leaves open the question of obtaining theoretical guarantees about the performance 
of local search. 

Chakrabarty and Swamy (6] observed that MFL, even with arbitrary movement costs is a special case of 
the matroid median problem [|22.I . Thus, the approximation algorithms devised for matroid median indepen- 
dently by im and 161 yield an 8-approximation algorithm for MFL with arbitrary movement costs. 

There is a wealth of literature on approximation algorithms for (metric) uncapacitated and capacitated 
facility location (UFL and CFL), the fc-median problem, and their variants; see f27] for a survey on UFL. 
Whereas constant- factor approximation algorithms for UFL and A;-median can be obtained via a variety of 
techniques such as LP-rounding ||28l |23l [H H), primal-dual methods |Il8l|I3, local search ETllTllBll. all 
known 0(l)-approximation algorithms for CFL (in its full generality) are based on local search ||2T1 [301 151. 
We now briefly survey the work on local-search algorithms for facility-location problems. 

Starting with the work of [21 1, local-search techniques have been utilized to devise 0(l)-approximation 
algorithms for various facility-location problems. Korupolu, Plaxton, and Rajaraman |[2T1 devised 0(1)- 
approximation for UFL, and CFL with uniform capacities, and fc-median (with a blow-up in k). Charikar 
and Guha [7 j, and Arya et al. [3| both obtained a (1 + -v/2) -approximation for UFL. The first constant-factor 
approximation for CFL was obtained by Pal, Tardos, and Wexler 1,26 J . and after some improvements, the 
current-best approximation ratio now stands at 5 + e [|5l. For the special case of uniform capacities, the 
analysis in ||2T1 was refined by lITOl . and Aggarwal et al. [[H obtain the current-best 3-approximation. Arya 
et al. f3] devised a (3 + e) -approximation algorithm for A:-median, which was also the first constant-factor 
approximation algorithm for this problem based on local search. Gupta and Tangwongsan 1,151 (among other 
results) simplified the analysis in [3|. We build upon some of their ideas in our analysis. 

Local-search algorithms with constant approximation ratios have also been devised for various variants 
of the above three canonical problems. Mahdian and Pal [24], and Svitkina and Tardos [29] consider settings 
where the opening cost of a facility is a function of the set of clients served by it. In [24], this cost is a non- 
decreasing function of the number of clients, and in ||29l this cost arises from a certain tree defined on the 
client set. Devanur et al. llT2l and lITSl consider A;-facility location, which is similar to fc-median except 
that facilities also have opening costs. Hajiaghayi et al. [ 16] consider a special case of the matroid median 
problem that they call the red-blue median problem. Most recently, [14] considered a problem that they call 
the fc-median forest problem, which generahzes fc-median, and obtained a (3 + e) -approximation algorithm. 

2 The local-search algorithm 

As mentioned earlier, to compute a solution to MFL, we only need to determine the set of final locations of 
the facilities, since we can then efficiently compute the best movement of facilities from their initial to final 
locations, and the client assignments. This motivates the following local-search operation. Given a current 
set S of k = \T\ locations, we can move to any other set S' of k locations such that \ S"| = |S" \ 5| < p, 
where p is some fixed value. We denote this move by swap(S' \ S' , S' \ S). The local-search algorithm 
starts with an arbitrary set of k final locations. At each iteration, we choose the local-search move that 
yields the largest reduction in total cost and update our final-location set accordingly; if no cost-improving 
move exists, then we terminate. (To obtain polynomial running time, as is standard, we modify the above 
procedure so that we choose a local-search move only if the cost-reduction is at least e(current cost).) 



4 



3 Analysis leading to a 5-approximation 



We now analyze the above local-search algorithm and show that it is a (5 + o(l)) -approximation algorithm. 
For notational simplicity, we assume that the local-search algorithm terminates at a local optimum; the 
modification to ensure polynomial running time degrades the approximation by at most a (1 + e) -factor (see 



also Remark 3.8 1. 



Theorem 3.1 Let F* and C* denote respectively the movement and assignment cost of an optimal solution. 
The total cost of any local optimum using at most p swaps is at most [2) + 0[^;:^^'^F* + ('5 + 0(^^^3)^(7*. 



Although this is not the tightest guarantee that we obtain, we present this analysis first since it introduces 
many of the ideas that we build upon in Section |4]to prove a tight approximation guarantee of (3 + o(l)) 
for the local-search algorithm. For notational simplicity, we assume that all djS are 1. All our analyses carry 
over trivially to the case of non-unit (integer) demands since we can think of a client j having dj demand as 



dj co-located unit-demand clients. 



Notation and preliminaries. We use S" = {si, . . . , s^} to denote the local optimum, where facility i is 
moved to final location Sj G S. We use O = {oi, . . . , Ok} to denote the (globally) optimal solution, where 
again facility i is moved to Oj. Throughout, we use s to index locations in 5, and o to index locations in O. 
Recall that, for a node v, a{v) is the location in S nearest to v. Similarly, we define a* (v) to be the location 
in O nearest to v. For notational similarity with facility location problems, we denote c{i, Sj) by /j, and 
c(i, Oi) by /*. (Thus, fi and /* are the movement costs of i in S* and O respectively.) Also, we abbreviate 
c(j, a{j)) to Cj, and c(j, cr*{j)) to c*. Thus, cj and c* are the assignment costs of j in the local and global 
optimum respectively. (So MFL(5) = J2ieJ^ + Sjec ^i^) = {j ^ ^ • ^(j) = ■s} be the set 

of clients assigned to the location s ^ S, and D*{o) = {j £ V : a*{j) = 0}. For a set A C S, we define 
D{A) = UseA ^i^)' we define D*{A) for A C O similarly. Define cap(s) = {o £ O : a{o) = s}. We say 
that s captures all the locations in cap(s). The following basic lemma will be used repeatedly. 

Lemma 3.2 For any client j, we have c[j, a{a*{j))) — c(^j, (T(j)) ^ 2c*. 

Proof : Let s = (7{j), o = (7*{j), s' = a{o). The lemma clearly holds if s' = s. Otherwise, c(j, s') — 
c( j, s) < c{j, 0) + c(o, s') — c{j, s) < c* + c(o, s) — c{j, s) < c* + c{o, j) = 2c* where the second inequahty 
follows since s' is the closest location to o in 5*. ■ 

To prove the approximation ratio, we will specify a set of local-search moves for the local optimum, 
and use the fact that none of these moves improve the cost to obtain some inequalities, which will together 
yield a bound on the cost of the local optimum. We describe these moves by using the following digraph. 
Consider the digraph G = (.FU 5 U O, {(si, i), (i, Oi), (oj, cj(oi))}jgj-). We decompose G into a collection 
of node-disjoint (simple) paths V and cycles C as follows. Repeatedly, while there is a cycle C in our current 
digraph, we add G to C, remove all the nodes of G and recurse on the remaining digraph. After this step, 
a node v in the remaining digraph, which is acyclic, has: exactly one outgoing arc if v £ S; exactly one 
incoming and one outgoing arc if v £ F; and exactly one incoming, and at most one outgoing arc if v G O. 
Now we repeatedly choose a node v £ S with no incoming arcs, include the maximal path P starting at v in 
V, remove all nodes of P and recurse on the remaining digraph. Thus, each triple (sj, i, Oi) is on a unique 
path or cycle inVuC. Define center(s) to be o G O such that (o, s) is an arc in P U C; if s has no incoming 
arc inV UC, then let center(s) = nil. 

We will use V and C to define our swaps. For a path P = {si^, ii, Oi^, . . . , Sj,. , Oi^) £ V, define 
start(P)tobesjj and end (P) to be Oj^. Notice that cj(ojJ ^ P. Foreachs G S,\etVc{s) = {P : end(P) G 
cap(s)}, T{s) = {start(P) : P £ Vc{s)], and H{s) = {end(P) : P G Vc{s)] = cap(s) \ center(s). Note 
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that \Vc{s)\ = \T{s)\ = \H{s)\ = |cap(s)| - 1 for any s G 5" with |cap(s)| > 1. For a set ^ C S, define 
nA) = U.eA^ls), H{A) = [j^^^His], VM) = U.eA^c(s). 

A basic building block in our analysis, involves a shift along an s o = Oj' sub-path Z of some path or 
cycle inV U C. This means that we swap out s and swap in o. We bound the cost of the matching between 
T and 5 U {o} \ {s} by moving each initial location i £ Z, i ^ i' to a{oi) E Z and moving i' to o,/. Thus, 
we obtain the following simple bound on the increase in movement cost due to this operation: 

shl{t{s,o) = Y,if:-n)+ E c(o„a(o,)) <2j;/;-c(o,a(o)). (1) 

The last inequality uses the fact that c(oj, (T(oj)) < c(oj, Sj) < /* + fi for all i. For a path P G "P, we use 
shift(P) as a shorthand for shift(start(P), end(P)). 



3.1 The swaps used, and their analysis 

We now describe the local moves used in the analysis. We define a set of swaps such that each o G O is 
swapped in to an extent of at least one, and at most two. We classify each location in S as one of three types. 
Define t = [p^/^\ . We assume that t > 2. 

• ^o: locations s e S with |cap(s)| = 0. 

• Si. locations s e S\So with |D*(center(s))| < t or |cap(s)| > t. 

• locations s e S with |i:'*(center(s))| > t and < |cap(s)| < t. 

Also define S3 := U {s G S*! : |cap(s)| < t} (so s £ S3 iff |cap(s)| < t and |P>*(center(s))| < t}). 

To gain some intuition, notice that it is easy to generate a suitable inequality for a location s £ Sq: 
we can "delete" s (i.e., if s = Si, then do swap(s,i)) and reassign each j G D{s) to a{a*{j)) (i.e., the 
location in S closest to the location serving j in O). The cost increase due to this reassignment is at most 
Ylj&D{s) ^'i^ '^his yields the inequality fi < J2j(^D{s) ^cj- (We do not actually do this since we 
take care of the 5o-locations along with the S*! -locations.) We can also generate a suitable inequality for 



a location s G 52 (see Lemma 3.4| since we can swap in cap(s) and swap out {s} U T{s). The cost 
increase by this move can be bounded by J^Pf^vds) shift(P) and c(s, center(s)), and the latter quantity 
can be charged to j XyjG-D*(center(s)) 

{cj + c*); our definition of ^2 is tailored precisely so as to enable this 
latter charging argument. Generating inequalities for the 5i -locations is more involved, and requires another 
building block that we call an interval swap (this will also take care of the 5o-locations), which we define 



after proving Lemma 3.4 We start out by proving a simple bound that one can obtain using a cycle in C. 



Lemma 3.3 For any cycle Z £ C, we have < J2iez{~fi + ft c(oj, (T(oj))). 



Proof : Consider the following matching of n Z to 5 n Z: we match i to a{oi). The cost of the resulting 
new matching is J2i(^z /« + J2iez ^('''^^('^i)) which should at least J2i fi since the latter is the min-cost 
way of matching J" to 5. So we obtain < I]iez(-/i + o-(oi))) < Y^i(zz{-fi + f* + c{oi,a{oi))). ■ 



Lemma 3.4 Let s £ S2 and o = center(s), and consider swap(X := {s} U T(s), Y := cap(s)). We have 
0<MFL((5\X)uy)-MFL(5)< Yl 2/*+ Yl E ^c*. (2) 



6 



Proof : We can view this multi-location swap as doing swap(start(P), end(P)) for each P G Vds) and 
swap(s, o) simultaneously. (Notice that no path P € 'Pcl-s) contains s, since s = cr(end(P)) ^ P.) For each 
swap(start(P), end(P)) the movement-cost increase is bounded by shift(P) < YlieP "^ft • For swap(s, o) 
we move the facility i, where s = Sj, to o, so the increase in movement cost is at most c(s, o) = c{a{o) , o) < 

c{a{j),o) < Cj + c* for every j G D*{o). So since |Z)*(o)| > t, we have c(s,o) < Y2jeD* (o) t ' Thus, 
the increase in total movement cost is at most 

We upper bound the change in assignment cost by reassigning the clients in D*{o) U D{X) as follows. 
We reassign each j e D*{o) to o. Each j e D{X) \ D*{o) is assigned to cr*{j), if cr*{j) e Y, and 
otherwise to s' = a{a*{j)). Note that s' ^ X: s' ^ s since cr*{j) ^ cap(s), and s' ^ T{s) since 
Us"eT(s) cap(s") = 0. The change in assignment cost for each such client j is at most 2c* by Lemma 3.2 
Thus the change in total assignment cost is at most Ylj£D*(o)('^j ~ ^i) + YlijeD{X)\D*{o) '^^j- Combimng 
this with the bound on the movement-cost change proves the lemma. ■ 

We now define a key ingredient of our analysis, called an interval-swap operation, that is used to bound 
the movement cost of the Si- and 5o-locations and the assignment cost of the clients they serve. (We build 
upon this in Section [4] to give a tighter analysis proving a 3-approximation.) Let S' = {s'l, . . . , s'^} C 
5o U 5"!, r < be a subset of at most locations on a path or cycle ZinVuC, where s'^_^^ is the next 
location in {Sq U 5i) n Z after s^. Let O' = {o[, . . . , o^} C O where Og_i = center(sg) for q = 2, . . . ,r 
and o[. is an arbitrary location that appears after (and before s'^) on the corresponding path or cycle. 
Consider each s'^. If |cap(Sq)| > t, choose a random path P € Vc{Sq) with probability {s')\ ' 
Xg = {start(P)} and Yg = {o^}. If |cap(s;)| < t, set Xg = {s'g} U T{s'g), and Yg = {o'J U H{s'g). Set 
X = Ug=i Xg and Y = Ug=i Yg. Note that |X| = |y| < since \Xg\ = \Yg\ < t for every q = l,...,r. 
Notice that X is a random set, but Y = O' U H{S' n 5*3) is deterministic. To avoid cumbersome notation, 
we use swap(X, Y) to refer to the distribution of swap-moves that results by the random choices above, and 
call this the interval swap corresponding to S' and O' . We bound the expected change in cost due to this 
move below. Let l(s) be the indicator function that is 1 if s G and otherwise. 

Lemma 3.5 Let S' = {s'l, - ■ ■ , s'^} C 5oU5i, r <t^ andO' be as given above. Leto^ := center(s'i) = o-, 
where Oq = nil and D*{o'q) = if s'^ G Sq. Consider the interval swap swap(X = \Sg=i Xq-,Y = 
Ug=i ^q) corresponding to S' and O' , as defined above. We have 

r 

0< E[MFL((5\X)uy)-MFL(5)] < 5^shift(4,o;)+ 2/*+ ^S* " 



_ 0/ 

P&Vc(S'),i&P j&D*{0') 

2c* 



+ E 2c*+ J2 ^ + E (/r + /. + s* 

ieD(T(S'nS3)U(5'n53)) jeD(T(5'\53)) jeD'io'g) 



Proof : Let Z be the path in V or cycle in C such that S' U O' C Z. 

We first bound the increase in movement cost. The interval swap can be viewed as a collection of 
simultaneous swap(Xg, Yg), q = 1, . . . , r moves. If Xg = {start(P)} for a random path P G 'Pc{s'g), 
the movement-cost increase can be broken into two parts. We do a shift along P, but move the last initial 
location on P to s'g, and then do shift on Z from s'g to o'g. So the expected movement-cost change is at most 

— |— Y (shift(P) + c(end(P), 4))+ shifty, o;)<—|— 2/* + shift^, o'^) 

which is at most ^PgP J 2/* +shift(Sg, Og). Similarly, if | cap (Sg)| < t, we can break the movement- 
cost increase into shift(P) < X^jgp 2/* for all P G Vc{s'g) and shift(Sg, o'g). Thus, the total increase in 
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movement cost is at most 

r 

j;shift(4,o;)+ 2/;. (4) 

9=1 P£VciS'),i£P 

Next, we bound the change in assignment cost by reassigning chents in D = D*{0') U D{X) as 
follows. We assign each client j G D*{0') to a*{j). If |cap(s'^)| > t, then s'^ ^ X. For every client 
j £ D \ {D*{0')), observe that either a*{j) G y or a{a*{j)) ^ X. To see this, let o = a*{j) and 
s = a{o). Ifo^Y then s ^ S' n S3; also s ^ T(S"), and so s ^ X. So we assign j to a*{j) if cr*(j) G y 



and to a{a* (j)) otherwise; the change in assignment cost of j is at most 2c* (Lemma 3.2 1. 

Now suppose |cap(s'^)| < t, so s[ G X. For each j £ D \ {D*{0') U D*(Oo)), we again have 
cr*(j) G y or cr((7*(j)) ^ X, and we assign j to (T*(j) if cr*(j) G Y and to (T(cr*(j)) otherwise. We assign 
every j G -D n D*{o'q) to (recall that Oq = oj), and overestimate the resulting change in assignment cost 
by EjeD*(o^)(Cj + fi + fd- Finally, note that we reassign a client j G L»(r(5' \ ^3)) \ D*{0') with 
probability at most j (since cr(j) G X with probability at most j). So taking into account all cases, we can 
bound the change in total assignment cost by 



E E 2c* + ¥ + 1(4) E (5) 

ieD*(0') ien(T(5'n53)u(5'n53)) ieD(T{5'\53)) ieZ)*(oo) 

In we are double-counting clients in D(T{S') U (5" U S'3)) n D*{0'). We are also overestimating the 
change in assignment cost of a client j G D{X) n D*(oo) since we include both the l{si){c* + /* + /j) 

2c* 

term, and the 2c*j or terms. Adding Q and Q yields the lemma. ■ 



Notice that Lemma 3.4 immediately translates to a bound on the assignment cost of the clients in 



D* {center (s)) for s G <S'2. In contrast, it is quite unclear how Lemma 3.5 may be useful, since the expres- 
sion I]je-D*(o[,)(/f + fi) in the RHS of ^ may be as large as + /j) (but no more since |L'*(oq)| < t 
if l{si) = 1) and it is unclear how to cancel the contribution of f~- on the RHS. One of the novelties of 
our analysis is that we show how to amortize such expensive terms and make their contribution negligible 
by considering multiple interval swaps. We cover each path or cycle Z in different ways using intervals 
comprising consecutive locations from SqVJ Si. We then argue that averaging, over these covering ways, 
the inequalities obtained from the corresponding interval swaps yields (among other things) a good bound 
on the movement-cost of the (S'o U ^i) -locations on Z and the assignment cost of the clients they serve. 

Lemma 3.6 Let Z £ VUC, S' = {s'l, . . . , s[.} = SiCi Z, where s'g_^_l is the next Si-location on Z after s'^, 
and O' = {center(s']^), . . . , center(s^)}. Let o'^ = end(Z) if Z £V and center(s'^) otherwise. For r > t^, 



< E(^-/*-¥/0+ E 2/;+ + 

i'^z P(zp^(s'),ieP jeD*{zno) 



t 

ieD*(0'u{o;,}) jeDiTiznSsMznSs)) ieZ)(T(5'\S3)) 



Proof : We first define formally an interval of (at most) t"^ consecutive {Sq U 5i) locations along Z. As 
before, let o'^_-^ = center(sg) for g = 1, . . . , r. For a path Z, define s'^ = start(Z) for g < and s'^ = nil 
for q > r. Also define = Oq for g < and o'^ = end(Z) for q > r. If Z is a cycle, we let our indices 
wrap around and be mod r, i.e., s'g = s'^ j^^d r' "'g = '^'q mod r fo"" 9 ^'r = o'o = center{s[)). 

For 1 — < h < r, define 5^ = {s'^, s'j^j^^, ■ ■ ■ , s'f^J^^2_l] to be an interval of length at most on 
Z. Define = {o'/j, o'/^+i, • • ■ , o'/i+f2_i}- Note that we have 1 < = |0^| < if Z is a path, and 
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= |0^| = if Z is a cycle. Consider the collection of intervals, {S'_^2^i, S'_^2^2^ ' ' ' ' '^rl- ^'^^ esich 
S'j^, O'f^, where -f + 1 <h<r, we consider the interval swap {X^, Yh) con^esponding to S'i^,0'f^. We add 
the inequalities ^ xQ for all such h. Since each s' G 5' U {s'q} participates in exactly such inequalities, 
and each s'^ G S' is the start of only the interval 5*^, we obtain the following. 

T 

g=o PeVc{S'),ieP 

jeD*(0'u{o;})) jeD(T(zn53)u(znS3)) jeD{T{S'\Sa)) 

+ 5] iKo.))-^- 5] (/; + /. + c*). 

j:(7(o,)ez jeZ)*(o,) 
Notice that the S-locations other than s'^ on the s'^ o'g sub-paths of Z lie in 5*2, and for each i such 
that a{oi) G Z n we have c(o.j, a{oi)) < J2j^D*{oi) Thus, using ([T]), we have 

^shift(4,o;) = ^(/;-/,)+ Yl c{o,,a{o,))<Yif:-n)+ E 

9=0 iez j:(T(oi)eznS2 iez jeD*(znO) 

Since l(cr(o)) = 1 means that cr(o) G ^3, and so |L'*(o)| < t, we have 

E E ^^sE(^+^^)sE^+ E |-(«) 

i:cr(oi)eznS3ie-D*(oi) iez iez jeD*(znO) 

Incorporating ([8]l and (|9]) in ([7]), and simplifying yields the desired inequality. ■ 

For a path or cycle Z where IS*! n Z| < t^, we obtain an inequality similar to ([6]l. Since we can now 
cover Z with a single interval, we never have a client j such that none of o"(j), cr*(j), a{a*{j)) are in our 

new set of final locations. So the resulting inequality does not have any ' ^ ' + ^ terms. 

Lemma 3.7 Let Z £ VUC, S' = {s[, . . . , s[.} = SiCi Z, where s'^^i is the next Si-location on Z after s'g, 
and O' = {center(s'^), . . . , center(s^)}. Let o'^ = end(Z) if Z and center(s'^) otherwise. For r' < t^, 



^<Y.^n-fi)+ 2/*+ Y 



c,+c* 



t 

P(z'p^(^s'),ieP jeD*{zno) ^^^^ 

+ E E 2c* + Y 

jeD*{0'u{o',}) jGDiTiznSsMznSs)) jGD{T{S'\S3)) 



Proof : The proof is similar to that of Lemma 3.6 except that since we can cover Z with a single interval, 
we only need to consider a single (multi-location) swap. We consider two cases for clarity. 

1. Z is a path, or r > 0. As before, let o'^^^ = center(s'^) for g = 1, . . . ,r. If Z is a path, define 
s'q = start(Z). If Z is a cycle, we again set s'^ = s'^ mod r' ~ '^'q mod r ^'^^ 1- Consider the interval 
swap {X, Y) corresponding to S' U {s'q}, O' U {o^}. The inequality generated by this is similar to ([3]) 
except that we do not have any (/* + /• + c*) terms since for client j G D{X) U D* (Y), we always have 
that either a*{j) G 1" or a{a*{j)) ^ X. Thus, ([3]) translates to the following. 

o<^shift(4,o;)+ Y 2/;+ Y E 2c*+ Y ¥ 

9=0 PeVc(S'),ieP jeD*{0'u{o'^}) jeD{T{znS3MznS3)) j£D(T{S'\S3)) 
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Substituting E5=o shift(4, o'^) < Yliezif* - fi) + T,jeD*{znO) in ^ yields the stated in- 

equality. 



2. Z is a cycle with r = 0. Here, Lemma 



3.3 



yields < J2^ezif! - h) + T.jeD*{znO) ^ (which is 



the special case of the earlier inequality with Sq = nil = o'^, S' = O' = Z f] S3 = 9). 



Proof of Theorem 3.1 : We consider the following set of swaps. 



Al. For every s G S2, the move swap({s} U T(s), cap(s)). 



A2. For every path or cycle Z with | Zn Si | > t^, the ^^-weighted interval swaps as defined in Lemma 

f2 



3.6 



A3. For every path or cycle Z with \ Z n Si\ < t , the interval swap defined in Lemma 3.7 

Notice that every location o G O is swapped in to an extent of at least 1 and at most 2. (By "extent" 
we mean the total weight of the inequalities involving o.) To see this, suppose first o = end{Z) for some 
path Z, then o is involved to an extent of 1 in the interval swaps for Z in A2 or A3. In this case, we say 
that the interval swap for Z is responsible for o. Additionally, if s = a{o) G 5*2, then o belongs to the 
multi-swap for s in Al, else if s G 5*3 then o is part of the interval swap for the path/cycle containing s in 
A2 or A3. Now suppose o = center(s). If s G 52, then o is included in the multi-swap for s in Al. We say 
that this multi-swap is responsible for o. If s G , then o is included in the interval swap for the path/cycle 
containing s in A2 or A3, and we say that this interval swap is responsible for o. 

Consider the compound inequality obtained by summing ([2]), ([6]), and ( fTO) ) corresponding to the moves 
considered in Al, A2, and A3 respectively. The LHS of this inequality is 0. We now need to do some 
bookkeeping to bound the coefficients of the /*, fi, c*,Cj terms on the RHS. We ignore o(l) coefficients 
like J, |j in this bookkeeping, since for a given {/*, fi, c*,Cj} term, such coefficients appear in only a 
constant number of inequalities, so they have an o(l) effect overall. Let F and C denote respectively the 
movement- and assignment- cost of the local optimum. 

Contribution from the c* and cj terms. First, observe that for each o G O, we have labeled exactly one 
move involving o as being responsible for it. Consider a client j G D{s) n D*{o). Observe that c* or Cj 
terms appear (with a B(l) -coefficient) in an inequality generated by a move if (i) j is reassigned because the 
move is responsible for o; or (ii) s is swapped out (to an extent of 1) by the move (so this excludes the case 
where s G T{s'), s' G 5i \ ^3 and the move is the interval swap for the path containing s'). If (i) applies, 
then the inequality generates the term (c* — cj). If (ii) applies then the term 2c* appears in the inequality. 
Finally, note that there are at most two inequalities for which (ii) applies: 

- If s = start(Z) G Sq, then (ii) applies for the interval-swap move for Z. If s' = a{end{Z)) G 52 U ^3, 
then (ii) again applies, for the multi-swap move for s' if s' G 52, or for the interval swap for the path 
containing s' if s' G 53. 

- If s G 5i n 53, then (ii) applies for the interval swap for the path containing s. 

- If s G 52, then (ii) applies for the multi-swap move for s. 

So overall, we get a 5c* — cj contribution to the RHS, the bottleneck being the two inequalities for which 
(ii) applies when s G start(Z) and cr(end(Z)) G 52 U 53. 

Contribution from the /* and fi terms. For every i G the expression (/* — fi) is counted once in 
the RHS of the inequality ^ or ( fTO] ) for the unique path or cycle Z containing i. The total contribution 
of all these terms is therefore, F* — F. The remaining contribution comes from expressions of the form 
J^peVcis) ieP "^fi '^^ RHS of Q, ([6]l, and ( fTO) ). The paths P involved in these expressions come from 



■Pc(52) U (UzGPuc ^c(^ n 53)) C V. Therefore, the total contribution of these terms is at most 2F*. 
Thus, we obtain the compound inequality 

0< (5 + o(l))C7* + (3 + o(l))F*-(l-o(l))(F + C) (11) 
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where the o(l) terms are 0{\) = 0{p-^/^). This shows that F + C < (3 + o{l))F* + (5 + o{l))C*. ■ 



Remark 3.8 A subtle point to note is that in the above analysis: (1) we consider only a polynomial number 
of swap moves (since there are at most 0{nP) swap moves available at any point), and (2) we place a 
constant weight (of at most 1) on the inequality obtained from any given swap move (when we take the 
weighted sum of the inequalities obtained from the various swap moves). This is relevant because we can 
only ensure in polynomial time that we terminate at an approximate local optimum. More precisely, for 
any polynomial f{n) and an e > 0, we obtain in polynomial time a solution with cost F + C such that the 
change in cost due to any local move is at least —jf^ ■ {F + C) (instead of 0). But this means that the LHS 
of (V\_) is now —jfn) ' (-^ ^)-^' where N is the total weight placed on the inequalities generated from the 
various swap moves whose suitable linear combination yields ( [TT| ). Therefore, since < /(n), this only 
results in a (1 + e)-loss in approximation factor. 



4 Improved analysis leading to a 3-approximation 

In this section, we improve the analysis from Section |3] Specifically, we prove the following theorem. 

Theorem 4.1 The cost of a locally-optimal solution using p swaps is at most 3 + (^\J ^°iog^ ^ times the 
optimum solution cost. 

To gain some intuition behind this tighter analysis, note that the only reason we lost a factor of 5 in the 
previous analysis was because there could be locations s = start (Z) € So that are swapped out to an extent 
of 2; hence, there could be clients j G D{s) for which we "pay" 2c* each time s is swapped out, and also 
pay an additional c* — cj term when a* {j) is swapped in. To improve the analysis, we will consider a set of 
test swaps that swap out each location in S to an extent of 1 + o(l). 

The aforementioned bad case happens only when s' = cj(end(Z)) £ S2U S-^, because when we close 
(i.e., swap out) s' as part of an interval swap or a multi-swap, we open (i.e., swap in) all the locations in 
H{s'), and we achieve this via path swaps (i.e., shift moves) along paths in Vc{s') that swap out locations 
in T(s') (for a second time). The main idea behind our refined analysis is to not perform such path swaps, 
but instead to "recursively" start an interval swap on each path in Vc{s'). Of course, we cannot cany out 
this recursion to arbitrary depth (since we can only swap a bounded number of locations), so we terminate 
the recursion at a depth of i^. So, whereas an interval swap included at most 5"! -locations on the main 
path or cycle Z, we now consider a "subtree" swap obtained by aggregating interval swaps on the paths in 
ljVc{Z n S's). A subtree swap can be viewed as a bounded-depth recursion tree where each leaf to root 
path encounters at most locations in Si. Because we no longer initiate path swaps for S'a-locations, a 
leaf location s" G ^3 in this recursion tree will not have any locations in cap(s") opened. But we will 
slightly redefine the Si, S2, S-^ sets to ensure that |D*(cap(s"))| < t, and use the same trick that we did 
with interval swaps in Section [3} we average over different sets of subtree swaps (like we did with interval 
swaps in Section [sjl to ensure that s" is a leaf location with probability at most p. This ensures that we 
incur, to an extent of at most j, the cost /* + fi + c{oi, s"), where Oj = center(s"), for moving j with 
a{a*{j)) = s" from s" to Si. 

Notation. Let t be an integer such that p > t^t^^ + 1. We prove that the local-search algorithm has 
approximation ratio 3 + 0{t^^). We redefine So, 5*1, ^2 and ^3 as follows. 

• So = {seS: |cap(s)| = 0}. 

• Si = {se S\So: |Z)*(cap(s))| < t or |cap(s)| > t]. 
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• S2 = {seS: |D*(cap(s))| > t, |cap(s)| < t}. 

• S'3 = SoU{s G Si : |cap(s)| < t}. 

Clearly, S = SqU SiU S2. We also redefine center(s) to be the location in cap(s) closest to s. 

Claim 4.2 Let s e S2 and o = center(s). Then c{s, o) < | Ylj^D*{cap{s))i'^j + ^j)- 

Proof : We have c{s,o) < c{s,o') for any 0' G cap(s), and c{s,o') < cj + Cj for any j G D*{o'). 
Therefore, c{s,o) < cj + Cj for any j G D*{cap{s)), and the claim follows since |D*(cap(s))| > t as 
se S2. m 




these edges are deleted when moving from H to H* O * ^ '^1 



It will be more convenient to work with the digraph H = {F, E) obtained from G by contracting each 
triple {sj, i, o^} of nodes associated with a facility i into a single node that we also denote by i. Thus, {i, i') 
is an arc in £^ if a{oi) = Si' (it may be that i = i'). Note that H may have self loops, and each node in H has 
outdegree exactly 1 (counting self-loops) so each component of H looks like a tree with all edges oriented 
toward the root, except the root is in fact a directed cycle (possibly a self-loop). The figure above illustrates 
this graph and some of the subgraphs and structures discussed below. 

For brevity, we say that an edge {i, i') in is a center edge if Oj = center(si/). In the arc set E' = 
G E : Oi = center(sj/)}), each node has indegree and outdegree at most 1, so E' consists of a 
collection node-disjoint paths V and cycles C. For a facility i G V, let V{i) denote the unique path in V 
containing i. Let start(z) and end(i) denote the start and end facilities of V{i) respectively. For distinct 
facilities i, i' with s-i, Sii G ^o, the paths V{i) and 'P(i') are clearly vertex disjoint. 

Now define H* = {F,E' U {{i,i') : Sj/ G S-i,a{oi) = Sj/}); that is, H* is the subgraph of H with 
node-set F and edges (i, i') of E where (i, i') is a center edge or Sj' G 53. Call a node i of H* a root if i 
has no outgoing arc or i lies on a directed cycle in H*. 

We consider an integer 1 < I < and describe a set of swaps for each index /. The inequalities for the 
swaps for different / will be averaged in the final analysis. We obtain Hf by deleting the edges (i, i') of H* 
where: i is not on a cycle, s^i G ^i, and the number of facilities i" with Sj" G on the path between i' 
and the root of its component in H* (both included) is I mod t'^. We define a subtree of to be an acyclic 
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component of , or a component that results by deleting the edges of the cycle contained in a component 

For a facility i, define cand(i) = {i' : Oi' G cap(sj) \ {center(si)}, i' path in H*}. Note that 

|cand(i)| > |cap(sj)| — 2. The reason we define cand(i) is that we will sometimes perform a shift along 
some path Z G Vc{si) to reassign the facilities on Z but we will not want this to interfere with the operations 
in the subtree of containing i. For a facility i with Si 5*2, let next(i) be the facility obtained as follows. 
Follow the unique walk from iin H using only center edges until the walk reaches a node i' with either no 
outgoing center edge, or the unique {i' , i") center edge satisfies Sj" G Si; we set next(2) = i' . 

Claim 4.3 The number of facilities i with Sj G 5o U Si in any subtree of is at most i*^. 

Proof : The facilities i in such a subtree that are in 52 have indegree and outdegree at most 1. Shortcutting 
past these facilities yields a tree with depth at most and branching factor at most t. ■ 



The test swaps. For a subtree T of , we describe a set of nodes Xt to be swapped out and a set of 
nodes Yt to be swapped in with \Xt\ = I^tI < We do not actually perform these swaps yet to generate 
the inequalities since we will have to combine some of these swaps for various components. 

For each f G T with Sj G 5*0 U 5i, we add the following location in S to Xt'- if Si G 5*3 we add Sj to 
Xt', otherwise (so Sj G Si \ S^), we choose any single i' G cand(z) uniformly at random and add Sstart(j') 
to Xt- We also add Onext(i) to Yt- 

When we say perform swa.'p{XT ,Yt) , we specifically mean the following reassignment of facilities. 
For Si G Xt with Si G S^, we perform shift(si, Onext(j))- For Si G Xt with Sj G 5i \ S^, say i' is the 
facility in cand(i) for which Ssta|.t(i') was added to Xt- Then we perform shift(s5tart(i')i move facility 
i' from Oil to si, and finally perform shift(sj, Oneyx{i))- As always, each client is then assigned to its nearest 



final location. Lemma 4.4 implies that these shift operations charge different portions of the local and global 
optimum. 

Lemma 4.4 For a subtree T, all of the shift operations described for swap(X'jr', Yt) have their associated 
paths being vertex disjoint. 



Proof : For any subtree T, the paths between Sj and Onext(j) for the facilities i G T with Si £ SqL) Si are 
vertex-disjoint by definition of next(i). Also, for any two distinct ii,i2 G T, and any i G cand(ii), i' G 
cand(z2), we have start(i) ^ start(i'), and so their associated paths V{i) and V{i') are also vertex-disjoint. 

Finally, consider any i G T with Si £ Sq U Si, and i" G T (i" could be i) with s," G Si \ Ss. Consider 
the paths involved in swap(sj, Onext(i)) and swap(sstart(j') ) Oj'), where i' G cand(i"). Note that both of these 
paths consist of only center edges. Therefore, since each facility has at most one incoming and one outgoing 
center edge, and i' = end(i'), if these paths are not vertex-disjoint, then it must be that the path involved 
in swap(sj, Onext(t)) ^ subpath of the path involved in swap(sstart{j')' '^i')- This means that i and i', and 
hence, i, i' , i" , are all in the same component of H* . Also, the edge (i', i") is not in H* so i' is the root of 
its component in H* . But then there is a path from i" to i' , which contradicts that i' G cand(i"). ■ 

We need to coordinate the swaps for the various subtrees of . Consider a component Z in H* . Let 
C = if Z is rooted at a node, otherwise let C be its cycle of root nodes. Let zi, . . . , be the facilities on 
C with Sj G Si, indexed by order of appearance on C starting from an arbitrary facility ii on C (A; = if 
C = 0). We consider four kinds of swaps. 

Type 1. If 1 < /c < t^, simultaneously do SYja.'\){XT ,Yt) for all subtrees T rooted at some i £ C with 
Si G Si. 
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Type 2. Otherwise, if A: > t^, define /;/ = {i;/, . . . , i;/+t2_i} for all = 1, . . . , A; (where the indices 
are mod k). Simultaneously perform swap(Xr, It) for each subtree T rooted at a facility in Reasoning 



similarly as in Lemma 4.4 and noting that the subtrees involved in a single type-1 or type-2 swap are all 
disjoint, we can see that all shift paths involved in a single type- 1 or type-2 swap are vertex-disjoint. 

Type 3. For each i with si G ^2, simultaneously perform swap(XT, Yt) for all subtrees T rooted at some 
i' with Oil G cap(sj) \ {center(sj)}. At the same time, we also swap out Sj and swap in Oj// = center(sj) 
for a total of at most t* "^^ + 1 < p swaps. It may be that some (at most one) shift path in this swap includes 
Si, but then we just move i" to Oj// instead of Si, and then move i according to the shift operation. 

Type 4. Finally, for every other subtree T of that was not swapped in the previous cases, perform 
swap(Xj', Yt) on its own. 

Analysis. We first bound the net client-assignment cost increase for any single one of these test swaps. 
So, fix one such swap, let {Tr}^^i, k < he the set of subtrees involved in the swap, and let B denote the 
set of facilities i such that Oj = center{a{oi)) and a{oi) is closed during the swap while Oj is not opened. 
So B consists of facilities with a center edge to some leaf of some subtree or, if the swap is of type 2, to 
the start of an interval /;/. For this swap, let Ci = {j £ V : (J*{j) is opened}, C2 = D*{{oi : i G B}), and 
C3 = {j ■■ cr{j) = Sie So and end(i) G U,. Ui'eT, cand(i')}- 

Lemma 4.5 The expected change in client-assignment cost for a test swap is at most ^j(^(j^{c* — Cj) + 

X^jgCj 2c* + j3Y X^j'eCa ^'-j"'"^^ X^ieB (A* ~^ /«)• the expectation is over the random choices involved 

in selecting facilities from the appropriate cand(.) sets. 

Proof : After the swap, we move every j G Ci from cr(j) to (T*{j) for a cost change of c* — Cj. Every 
client in j G C2 U C3 for which a{j) is closed is moved initially to a{a*{j)) for a cost increase of at most 
2c*. 

Suppose i is such that (j{oi) = o'{a*{j)) and Oj = center {a (oi)). It may be that a{oi) is still not open 
which means that i ^ B. Note that either Si or Oi is opened after the shift and we move every client that was 
moved to a{oi) to Si or Oj (whichever is open). This extra distance moved is at most f* + fi + c{oi, o"(oi)) < 
2/* + 2fi. Note that i ^ B implies that cr(oi) G S3, otherwise a{oi) would not have been closed down in 
the swap. So \D*{cap{a{oi)))\ < t, by definition of 5*3, and at most t clients will be moved to either Si or 
Oi in this manner. 

Finally, we note that while j G C3 may have a{j) being closed, this only happens with probability at 
most m 

Now, we consider the following weightings of the swaps. First, for a particular index 1 < Z < we 
perform all type 1, 3, and 4 swaps. For a component of Hf containing a cycle C, we perform all type-2 
swaps for the various intervals for C and weight the client and facility cost change by ^ . Finally, these 
weighted bounds on the client and facility cost change are averaged over all 1 < Z < t^. 

Lemma 4.6 The expected change in client-assignment cost under the weighting described above, is at most 

3c* - c, + o (i) (E,(/* + h) + c*). 



Proof : For a fixed I, every client j is in Ci as in Lemma 4.5 to an extent of 1; either once in a type 1, 3, 
or 4 swap or exactly times among the type-2 swaps, each of which is counted with weight ^. Similarly, 
every client j is in C2 to an extent of at most 1 and is in C3 to an extent of at most 1 over all swaps for this 
fixed I. Finally, we note each facility i on a cycle in H* lies in the set B for at most one offset 1 < I' < k 
for that cycle, so its contribution 2t{f* + fi) to the bound is only counted with weight -p for this fixed /. 
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Lastly, every facility i not on a cycle in H* lies in B for at most one index 1,1 < I < t"^ and, then, 
in only one swap for that particular /. Since we average the bound over all indices / between 1 and t'^, the 
contribution 2t{f* + fi) of such i is counted with weight only ^. ■ 

Next we bound the expected facility movement cost change. Let F' be the set of facilities i that do not 
lie on a cycle in H* consisting solely of facilities i' with Sj' G 82- 

Lemma 4.7 The expected change in movement cost (under the weighting described above) is at most 

E^eF' if: -h) + o (i) (E, /; + + c,)). 

Proof : We consider two cases for a facility i. First, suppose Si £ 5o U 5i. Then when Si is swapped out in 
a subtree during the shift from Sj to Onext(i)> ^ is moved to either Oj, if i = next(i), or to a{oi), if i / next(i). 
The latter case implies that cj(oj) G 82- The total movement change is at most /* — fi if i is moved to Oj 
and is at m ost /* — fi + c{oi, <j{oi)) if i is moved to a{oi). Since cr(oj) G 52 and Oj = center(cr(oj)), by 
Claim |43|we have that c(oi, a{oi)) < ^ EieD-(cap(a(o.)))(Ci + ^i)" 

The only other time i is moved is when end(i) is randomly chosen from cand(i') for some facility i'. But 
this happens with probability at most In this case, i is shifted from Si to a{oi). We do not necessarily 
have cj(oj) G 82 in this case, but we can use the bound c{oi,a{oi)) < f* + fi to bound the expected 

2f* 

movement-cost change for i in this case to be at most Overall, the expected movement-cost change for 
i is at most 

iGD*(cap(fj(o,))) 

Next, we consider the case Si G 52- Let center(si) = Oj/. When the swap consisting of i and all subtrees 
rooted at cap(sj) \ {0^/} is performed, i is moved from Si to Oj' unless i lies on a shift path during that swap, 
in which case it is moved like in the shift. Since Si G 82, we have c{si, Oj/) < j J2jeD*{cap{s,))i'^j + ^j)- 
Unless i lies on a cycle with no 5i -locations, that is, i ^ F', i lies between i" and next(i") for exactly one 
i", and shift(sj//, Onext(i")) is performed to an extent of 1; this holds even if Si lies on a shift path during the 
corresponding type-3 swap involving i. All other times i when is moved, it is due to the same reasons as in 
the previous case, so the total change in movement cost for facility i is at most 

jGD'(cap{s,)) ieD*(cap{a(oO)) 

Adding up the appropriate expression for each facility accounts for the expected change in total move- 
ment cost. ■ 



Proof of Theorem 4.1 : By local optimality, the change in total cost for every test swap (counting every 



random choice) is nonnegative. By averaging over the various swaps, the expected change in total cost is 



nonnegative, so the sum of the expressions in Lemmas 4.6 and 4.7 is nonnegative. To generate an inequality 



involving a — fi term for facilities i ^ F', we sum the bound given by Lemma 3.3 here over all cycles of 



H* involving only facilities i with Si G 5*2. This yields < J2i^F' {~fi + fi + c(oi, cr(oj))), and we can 
bound c{oi,a{oi)) by j J2jeD*{cap{a{oi)))i^j + '^i)- Adding this to the inequality that the expected change 
in total cost is nonnegative gives (l - O (|)) (C + F) < (3 + O (|)) C* + (l + O {])) F*. m 



15 



5 Extension to the weighted case 



The analysis in Section [4] (as also the proof of the 5 approximation) extends easily to the weighted gener- 
alization, wherein each facility i has a weight Wi > and the cost of moving z to s is given by Wic{i, s), 
to yield the same (3 + o(l)) -approximation guarantee. With the exception of one small difference in the 
analysis, this requires only minor changes in the arguments. We discuss these briefly in this section. 

Unless otherwise stated, the same notation from Section|4]is used in this section. We emphasize that /* 
and fi now refer to the weighted movement cost of facility i in the global or local optimum, respectively. 
So, /* = Wi ■ c{i, Oi) and fi = Wi ■ c{i, Sj). 

One difference in notation is that the definition of is slightly revised to this weighted setting: Sj G Si 
if |cap(sj)| > t, or < |cap(si)| < t and |D*(cap(si))| < maxjwj , Wj/ } • t, where i' is such that 
Oil = center(sj) (equivalently, {i' ,i) is a center edge in H). If all facility weights are 1, then this def- 
inition of Si agrees with the definition in Section |4] Similarly, we say Si € 5*2 if |cap(si)| < t and 



|D*(cap(sj))| > maxjwj, Wj'} • t. Under these definitions, similar to Claim 4.2 we now have that 

Wi ■ c{si,Oii) < iEjeD*(cap(s>))('^i + ^i) ^^^"'^^ c{oi,a{oi)) < c* + Cj for any j G D*{cap{si)) as 
before, and |D*(cap(sj))| > Wit). 

We consider the same set of test swaps and the same averaging of the inequalities generated by these 
swaps. When a test swap is performed, we consider the same shift and reassignment of facilities to generate 
the inequalities. In most cases, we also move the clients in the same way as before with the exception that if 
a client j has all of a{j),(j*{j) and a{a*{j)) being closed, then we do not necessarily send it to Si where i 



is such that Oj = (T*{j). This is discussed in Lemma 5.1 
As in the discussion before Lemma 



4.5 



we consider a swap involving subtrees {Tr}^^i. Let B be as 
before, and let B' be the set of facilities i such that i is a leaf in some Tr or, if the swap is a type-2 swap, 
that i is the first facility in /;/. Note that i £ B if and only if the unique arc in H* is a center arc with 
i' € B'. We let Ci, C2, and C3 also be defined as in Section |4] 

Lemma 5.1 The expected change in client assignment cost for a test swap is at most Xlj eCi ('-j ~ '^i) + 



Proof : Consider one particular swap. As in the proof of Lemma 4.5 we move j G Ci to (T*{j) and 
i € C2 U C3 to a{a* (j)) and bound their assignment cost change in the same way. As before, it may be that 
for some of these clients j G C2 U C3 we have that a{a*{j)) was closed in the swap. For such clients, we 



do the following slight variant of the reassignment that was done in the proof of Lemma 4.5 

Suppose (i, i') is the center edge such that a*{j) = Oi for a client j £ C2U C3, and Sj' is not open. If 
Wi > Wi', then we send j to either Si or Oj. As in the proof of Lemma |43| one of these must be open and the 
total cost of moving j from Sj' to either Si or Oj is at most 2c(z, Sj) + 2c{i, Oj). Otherwise, if Wi' > Wi then 
we send j to either Oj' or cr(oj') (one of them must be open). The distance from Sj' to either Oj/ or fT(oj') is 
bounded by 2c{i\ Si') + 2c(i', Oj'). 

We conclude by noting that each facility i £ B has at most w- • t clients sent to either s~- or o- from 
cr(Oj-) in the manner just described, because o"(oj) must be in S3. Similarly, each i £ B' has at most w~- ■ t 
clients sent to either or cr(o-) from s-- in the manner described above, since G 5*3. So, the total client 
movement charged to i £ B U B' this way is at most 4:twi{c{i, Si) + c(i, Oj)) = Atf* + 4t/j. ■ 

Using the same weighting of the swaps as in Section [4] we get the following bound on the contribution 
of the client movement cost changes over these swaps. The proof is nearly identical, except we notice that 
a facility i' lies in the B'-set for various swaps to an extent of at most (under this weighting), since the 
facility i such that {i, i') is a center edge lies in some S-set to an extent of at most ^. 
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Lemma 5.2 The expected total client assignment cost change, weighted in the described manner, is at most 

3c* -c,+0 (1) + /.) + c*). 

The contribution of the faciUty movement costs is bounded in essentially the same way as in Lemma 



4.7 We just provide the details on how to account for the weights of the facilities. As before, let F' be the 



set of facilities i that do not lie on a cycle in H* consisting solely of facilities i' with Sj/ G 82- 

Lemma 5.3 The expected change in movement cost is at most YliieF' ifi ~ fi) ~^ O ( j) {J2i f* + + 

Proof : When shift(s, 0) is performed, we move facilities i from Si to center(oj). If this shift was performed 
during a path swap, then the movement-cost change for a facility i moved in this shift is at most Wic{i, Oi) + 
Wic{oi, cr{oi)) — Wic{i, Sj) < 2wic{i, Oi) = 2/* so the same bound used before applies. 

If such a shift was performed along a path in a subtree, then we did not want to bound c(oj, cr{oi)) by 
c(i, Si) + c(z, Oi) because we do not want to cancel the contribution of — c(i, Si) to the bound. However, this 
only happened when (T(oi) G 52 so we can use the fact that |D*(cap(cr(oi)))| is large and that c(oi, (T(oi)) < 
c* + Cj for any j G D*(cap(cr(oj))). In our setting, as noted earlier, the movement cost Wi ■ c{oi, o"(oj)) 

can be bounded by j J2jeD*{cap{a{o^))) ('^j + ^j)' which is the same upper bound we used in the unweighted 
case. 

Finally, the only other time we moved a facility was from some Sj G 5*2 to center(si). The cost of this 
move is now Wi ■ c{si, center(si)) which can also be bounded by j EjeD'(cap(si))('^i + '^j) using the same 
argument as in the previous paragraph. So, all bounds for the unweighted faciUty movement cost increase 
averaged over the swaps also hold in the weighted case. ■ 

Finally, we remark that the same bound for the facility movement cost for facilities on a cycle with only 
152 facilities holds for the weighted case, again using arguments like in the proof of Lemma |5.3| to bound 



Wi ■ c{oi, cr{oi)). Thus, the proof of Theorem 4.1 is adapted to prove the following result for the weighted 
case. 



Theorem 5.4 The cost of a locally-optimal solution using p swaps is at most 3 + O ( < / ] times the 



logp 

optimum solution cost in weighted instances of mobile facility location. 



6 The single-swap case 

We show in this section that the local-search algorithm has a constant approximation guarantee also when 
p = 1 (which corresponds to t = 1 in Section |3]), even in the weighted setting. This requires a different 
analysis than in Section[3]since we now no longer have the luxury of amortizing the "expensive" terms in an 
interval swap via multi-location swaps. The approximation factor we obtain in the analysis below is large, 
but we emphasize that we have not sought to optimize this constant. Also, we remark that the analysis can 
be significantly simplified and improved in the unweighted setting. 

Recall that /* = Wi • c(i, Oj) and fi = Wi- c{i, Si). We use much of the same notation as in Section|3] The 
digraph G, its decomposition into paths V and cycles C, and the definition of center(s) are as in Sectionji] 
Thus, for a path P G we have a(end(P)) ^ P. Define Sq = {s £ S : |cap(s)| = 0}. Let Si £ S \ Sq 
with Oj/ = center(sj). We place in 5i if |D*(oj/)| < 1.5 max{wj, or |cap(sj)| > 1; otherwise we 
place Si in ^2. Also define S3 := SqU {s e Si : |cap(s)| < 1}. Let n* = \D*{o)\ for o G O. 

Lemma 6.1 Let Si G «S'2 and = center(sj), and consider swap(sj, o). We have 

0<MFL((S\{s,})U{o})-MFL(5)< ^ . c* - 1 • c,) + ^ 2c*. (12) 

ieD*{o) j(^D{s,)\D*{o) 
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Proof : We reassign clients in D*{o) to o, and each client j in D{si) \ D*{o) to o-{a*{j)) incurring a total 
assignment-cost change of J2jeD*{o)('^^ ~ "^i) + Ylj(^D{s)\D*{o) "^^'^ change in movement cost is at 
most 

WiC{0, Si)<Wi — ^ 3 + ) 

where the last inequality follows since Sj G 82- Adding this to the expression for the change in assignment 
cost yields the lemma. ■ 



In Lemmas 6.2 -6.5 we generate inequalities that will allow us to bound the total movement cost, and 
the total assignment cost of clients in S'o U 5i. We use the following notation for this sequence of lemmas. 
Given a path or cycle Z € "P U C, let S*^ = {s'^, . . . , s'^} = Si H Z, where s'g_^_l is the next 5i -location on 
Z after s'g. Let o'g_i = center{s'g) for g = 1, . . . , r, and O'^ = {oq, . . . , o'^^i}. Let s'g = Si^ and = o-^ 
for g = 0, . . . , r (see Fig.JTji. Let Zg denote the s'g ^ o'g subpath of Z. Consider an Oj — )• Si' —)•?'—)• Oj' 
subpath of Z. Let Az consist of all such Sj/ where Sj/ G 5' n ^3 and n*^ < 1.5wi. Let Bz consist of all 
such Sj/ where Si' £ S' H S3 and n* > l.Bwi. Note that if Sj' G Bz then Wi/ > Wi and n*. < l.Bwii. Let 
Cz consist of all such Sj/ where n* < 1.5wi. Clearly, Az ^ Cz ^ S'^ and Cz n = 0. When Z is clear 



from the context (as in Lemmas 6.2-6.5 1, we drop the subscript Z from S'^, O'^, Az, Bz, Cz- 
Q-.zns Q s O-.zno Q-.O' 




o- 





center(s' 1) 



Figure 1: The clear circles are locations in S; the shaded circles are locations in O. The clear and shaded 
circles with thick borders depict locations in S' and O' respectively. 



Lemma 6.2 Let Z £ V U C be such that r = \S'\ =0. Then, 



ifzevwith o<Y,if:-fi)+ E E E 2c*- (13) 

s'o = start(Z) iez ieD*((zno)\K}) iGD*K) ieD*(sj,) 

= end(Z) 



(14) 



Proof : First, suppose Z is a path. For every Oj G Z, Oj 7^ o[,, we have that o"(oj) G 6*2, and so we have 
Wi ■ c{oi,a{oi)) < J2jeD*{o,) i(cj + Cj)- Thus, the move swap(so,o[.) yields the following inequality, 
which implies ( [T3] ). 



o<E(/*-/^)+ E + E E 



2c*. 



i:(7{oi)£ZnS2 



For a cycle Z, analogous to Lemma 3.3 we have < X^iez {~fi + ft + ' ^(oi, cr(oj))) , and this coupled 
with the above bound onwi ■ c{oi, ajpi)) for i G Z yields ( [T?] ). ■ 
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Lemma 6.3 Let Z eV (with S', O', A, B, C as defined above). Let Sq = start(Z), and O" = O' U {o^ 
encl(Z)}. Suppose there is no q such that \Zq Ci J- \ = 1, s'g £ B, s'g_^i G C. Then, 



0< 5/* + j;(10.75/*-/i)+ 4.5c* + Y ^S* 

+ Y (6.5c* -c,)+ Y f(c.+S*)- 
jeD*{zno") jeD*{{zno)\o") 



(15) 



Proof : We derive ( [15] ) by taking a weighted combination of inequalities generated from various swap 
moves. We assume r > (otherwise ( [T3] ) implies ([15)). For a predicate R, let be the indicator 
function that is 1 if i? is true and otherwise. To keep notation simple, we follow the convention that a 
non-existent object (like o^_,_]^) is nil, and nil ^ T for every set T (e.g., o'^^^ ^ T for every T.) Also, 
L>(nil) = L>*(nil) = 0, and Wnw = 1. 

1) The first swap move is swap(so,o^), but we bound the change in cost slightly differently. We only 
reassign clients in D{s'q), obtaining the inequality 

0<^(/*-/.)+ Y w,-c{oi,a{oi))+ Y 2c-. (16) 

i&Z i(^Z:cr{Oi)(^Z j&D{s'f^) 

2n* 

Define Oj = 1 - 3;^. Note that ctj G [0, 1] if a{oi) e AuC. Consider each edge (oj, a{oi)) G Z. If 
a{oi) ^ A and Si ^ 13, we simply bound Wi ■ c{oi, o'(oi)) by fi + /*. Otherwise, if cj(oj) G C, we use the 
hound Wi-c{oi, a (oi)) < ai{fi + ft) + Y.jeD*{o,) i(cj + c*) which is valid since i(cj + c*) > 

2n* 

■ Wi ■ c{oi, (y{oi))- If cr(oi) ^ C and Si G -B, then n*. > \.f)Wi and we bound Wi ■ c{oi, cr(oi)) by 
J2jeD*{oi) i(ci + Cj). Incorporating these bounds in ( [T6] ), we obtain the following inequality. 

o<(/i;-4)+ J] 2/*+ Y (2/r - (1 - + E 



jeZ:o-{oi)eZ\A i£Z:a{oi)£ A or i£Z:a{oi)eZ\C 



ieD*{o,;) 



(17) 



2) For every q = 0, . . . ,r such that s'g G AU 5o, we consider the move swap(Sq, o'^). We move each facility 
i ^ Zq to (j{oi) if Oi 7^ Og and to otherwise. Note that for every facility i such that o,, cT(oj) G Zg, 
we have a{oi) G ^2, and so Wi ■ c{oi, (j{oi)) < J2jeD*{oi) i(ci + cp- We reassign all clients in D*{o'q) 
to o'g, and reassign each client j in D{s'g) \ (^D*{Og) U D*(o^_^)) to a{a*{j)). The resulting change in 
assignment cost is at most J2jeD*(o' ~ Cj) + J2jeD{s') ^c^- If Sq ^ we reassign all clients in 



D* (o' to S-- _ and bound the resulting change in assignment cost by ^ ) c* + ^ _|_ 

/* ). Therefore, if 5' E A U aSq, we obtain the inequality 

'iq — l " 

< E(/*-/^) + — (4 i + 1) 

(18) 

+ E i(c. + c-)+ E E s*+ E 2c*. 
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3) For every q such that s'^ E B, we again consider swap(s'^, o^). We move facilities on Zq and reassign 
clients in D*{o'q) U (-D(Sg) \ as in case 2). We reassign clients in D*{s'q) n D*{o'^_^) to Oj^ if 

Og = Oj^, and to cr(oi^) otherwise. The assignment-cost change due to this latter reassignment is at most 



1 



- • Ui, + ft,) if o'q = Oi^, and ■ {fi^ + /* ) + EjeD*K,)(ci + S*) otherwise, where to obtain 
the latter inequality we use the fact that n*, < l.5wi^ < n*. since s'^ = Si^ G '^{oi^) G S2 and 
c(oi^, cr(oj^)) < Cj + c* for all j G D*{oi^). We obtain the inequality 

n*, 

o<5^(/*-/.) + ^(4 + /*; 



(19) 

ieZg:a(oi)eZg j&D*{o'g) jeD{s'g) jeD*{o,g) 

j&D*{o,) 



4) For every q such that s'g G 5' \ 5*3, we pick some arbitrary P G Vds'q) and consider swap(start(P), o'^^). 
We analyze this by viewing this as a combination of: (i) a shift along P, (ii) moving i from Oj to s'^, 
where Oj = end(P), and (iii) a shift along the appropriate subpath of Z. The resulting inequality we 
obtain is therefore closely related to ( [TS] ), ([19]). We incur an additional X^ieP ^A* ^crrn for the change in 
movement cost due to (i) and (ii), and Ylj^D{stan{P)) ^'^j ^^^r reassigning clients in L)(start(P)). Also, 
since s'^ is no longer swapped out, we do not incur any terms that correspond to reassigning clients in 
D{s'g). Thus, we obtain the following inequality: 

0<E2// + E(/*-/^)+ E + E E 2c*. (20) 

ieP ieZq i&Zq:cr{oi)&Zq j&D*{o'g) je_D(start(P)) 

jeD*io,) 



We are finally ready to derive (J_5l. An inequality subscripted with an index, like (18,), denotes that 
inequality for that particular index. It will be useful to define 



(21,) = 1(4g Au5o)-(18,) + l(s;Gi?)-(19,) + l(4^53)-(20 



We take the following linear combination. 



2.25 X ([n) + ^2.5 X (21 



part 1 



(22) 



part 2 



The LHS of p2| ) is 0. We analyze the contribution from the /*, fi,c*, Cj terms to the RHS. 
Facilities i ^ Z contribute at most J^ieVdZ) ^ft ■ Consider i G Z. If Oj = end(Z) = 



o'j., we pick up 

4.75(/* — fi) from parts 1 and 2, and we may pick up an additional 2.5 • (/» + //) from part 2 if 
s'^ £ B, i = ir. So overall, we obtain a contribution of at most 8.5/* — fi. Next suppose cr(oj) G Z. If 
a{oi) € Z\A and Sj ^ B, we gather 4.5// from part 1 and 2.5(/* — fi) from part 2, so the total contribution 
is 7f* — 2.5/i. Suppose a{oi) G A. Notice then that Si ^ B, otherwise if Si = s'g, then Zq n T = {i}, 



which contradicts our assumptions. Recall that ai = 1 — -3:^. We gather 2.25(2/* — (1 — ai)fi) from 
part 1, and 2.5[/* - fi + ^(/, + /*)] < 6.25/* - 2.5/i(l - ^) from part 2. Thus, we gather at 
most 10.75/* - (2.5 -^)fi < 10.75/* - fi. If a(oi) e Z\Aand Si G B, then note that actually 
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a{oi) €Z\C.We gather 2.25(/* - /,) from part 1, and 2.5 [/* - fi + -^^^{fi + /*)] from part 2, so 
the total contribution is at most 8.5/* — fi. 

We now bound the {c*, Cj}-contribution. Part of this is EjeD(.;,) 4-5c* + 'EjeD{{S'nS:i)uH{znS)) 5c*- 
We proceed to analyze the remaining contribution. The clients whose remaining contribution is non-zero 
are of two types: (i) clients in D*{Z n O") U D{Z n S3), which are reassigned when a location in O" 
is swapped in or a location in S" is swapped out; and (ii) clients in D*{oi), where Oj G Z n O, which 
are charged when we bound the cost Wi ■ c{oi, (7{oi)) or when Si £ B is swapped out. Consider a client 
j G D*{o'g). Let i = iq. Its (remaining) part-1 contribution is 1.5(cj + c!|) if cr(oj) S A or Sj S -B, and 
otherwise. The part-2 contribution is at most 2.5(c* — Cj) + 2.5c* (this happens when a{oi) G A); so the 
total (remaining) contribution is at most 6.5c* - Cj. For j G D{s'q) \D*{Z r\ O"), where s'^ G Z n ^3, 
we know that j ^ D* (cap(Sq)), and so we have already accounted for its contribution of at most 5c* above. 
Finally, consider j G D*{oi), where Oj ^ O" . Note that cr(oj) G ^2. We gather 1.5(cj + c*) from part 1 
if Sj G -B and otherwise, and ^(cj + cp from part 2 if Sj G .B and |(cj + c*) otherwise; so in total we 
gather at most ^(cj + c*). 



Putting everything together, ( |22l ) leads to inequality ( |T5] l 



Lemma 6.4 Let Z ^V, s'q = start(Z) and o'^ = end(Z). We have 

< 32/; + ^^(92.5/* -fi)+ 42c* + 32c* 

+ E (41c*-c,)+ Y ^(cj+Cj*)- 

jeD*(zno") jeD*{{zno)\o") 



(23) 



Proof : We focus on the case where there exists an index q such that \Zq n T\ = I, s'^ G B, and 
G vl U C as otherwise, ( [T5| ) immediately implies ( [23] ). This case is significantly more involved, in part 



because when s'g £ B, s'^^i G A and o'g = Oi^, we accrue both the term ""^''^"''^ • {fi^ + /* ) in ([19]) when 



s'q is swapped out, and the term ■ {fi^ + /^p in ( [TS] ) when s'^^^^ is swapped out. Hence, there is no way 
of combining ([T7|)-( 20 1 to get a compound inequality having both — /j^ and — J2jeD*(o' ) Cj the RHS. 



In order to achieve this, we define a structure called a block, comprising multiple Zg segments, using which 
we define additional moves that swap in o'^ but swap out neither s'^ nor s^+i, so that the extent to which o'^ 
is swapped in exceeds the extent to which s'^ or s'^^-^ are swapped out. 

We call a set {Sg, Sg+i, • • • , s'^} of consecutive S"-locations a block, denoted by Bqu, if (i) \Zi n = 1 
and s'g G i? for all £ = g + 1, . . . , tt, (ii) s'^^i £ AuC (recall that if s'^_^_-^ is non-existent, then this condition 
is not satisfied), and (iii) \Zg n T\ > 1 or s'^ ^ B. Note that by definition, any two blocks correspond to 
disjoint subpaths of Z. We say that s,s' G S Ci Z are siblings, denoted by s « s', if s,s' G B and they 
belong to a common block; note that this means that neither s nor s' is at the start of a block. We use s ^ s' 
to denote that s and s' are not siblings. Let O" = O' U {o[.}. 

Before defining the additional swap moves for each block, we first reconsider swap(sg, o^) and account 



for the change in cost due to this move differently to come up with a slightly different inequality than ( 17 1 



We again start with ([T6]l, and bound Wi ■ c{oi, (7(oj)) in different ways for an (oj, cr{oi)) edge of Z. We use 

' fi + f*; if o-(oi) ^ A, Si(^ B, or Si ^ a{oi) 

Wi ■ c{oi,a{oi)) < I ai{fi + f*) + EjeD-(o,) i(ci + c*); if o-(oi) G A or a{oi) eC\A, Si e B 
XjeD'K) K^i + '^p5 if ^aioi) ^C, SiG B, and Si 96 a{oi). 
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Incorporating this in ([16]) yields the following inequality. 



o<(/e-4)+ E 2/;+ Yl (2/; - (1 - 



iGZ:Si^a{oi) or 
a{oi)eZ\A,s,'^B 



ieZ:(T{oi)eA or 

o-(oi)eC\A,s,;es 



+ E E E E 2c-. 



(24) 



ieZ:o-{oi)6Z\C 



ie2:o-(oi)eA or jeD*(oi) 



For every block Bqu, we consider the swap moves swap(s£, o^_(_^) for all £ = g + 1, . . . , n — 1. For each 
such £, we obtain the inequality 



^<^{h.+fl)+ui-h.)Hfi,,-h...)+ E E2s*+ E K^^+s*) (25) 



where webound tt^j^ -0(0^, by ^jg^.^g/j |(cj + cp. Also consider swap (s'^, o^^_;^) if s'^ G ^U5'oUi3 
or swap(start(P), o'^+i) if s'^ G S" \ ^s, where P is some arbitrary path in Vc{s'g). Note that if s'^ G then 
7^ o'^- This yields the inequality 

2/;+ (c*-c,) 



(26) 



+ i(4GS'\53)(E2/r+ E 



2c* 



We now derive ([23]) by taking the following linear combination of ( 18 )-(26 1: 



21 X ([24]) + ^ 16 X (21 
part 1 



q:Sq<^ any 
block 



'+ E E7x(2i 



blocks Bn 



+ Y 9x((26 



blocks Bn 



/)+ E (25? 



(27) 



part 2 



part 3 



part 4 



As before, the LHS of ( [27] ) is 0, and we analyze the contribution from the /*, fi, c*,Cj terms to the RHS. 
Many of the terms are similar to those that appear in ( |22| ), so we state these without much elaboration. 

Facilities i ^ Z contribute at most J^ieVdZ) '^'^ft ■ Consider i ^ Z. If Oj = o'^., then note that s'^ 
does not belong to a block, and we gather at most (21 + 16)(/* - fi) + 16 • 1.5(/i + /*) < 61/* - 13/i 
from parts 1 and 2. Suppose a{oi) G Z. If (j{oi) € Z \ A, Si ^ B, we gather 42/* from part 1. Let 
Si G Zq. If s'g does not belong to any block then we pick up 16(/* — fi) from part 2. Otherwise, note that 
s'q must be the start of a block, therefore, we pick up 7(/* — /«) + 9 • 2/* from parts 3 and 4. So the overall 
contribution is at most 67/* — 7/j. Next, suppose Sj o"(oi). Let Sj = s'^. We pick up 42/* from part 1, 

n*, n*, 71*, n*, 

+ ^)ft - (1 - ^)/.] from part 3, and 9[(2 + ^)/* - (2 - ^)/,] from part 4. This 
amounts to at most 91/^* — fi total contribution. Suppose a{oi) G A. We gather 21 (2/* — (1 — ai)/^) from 
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part 1. If Si ^ B, we gather 16[/* - fi + ^(/i + /*)] from parts 2, 3, and 4. If Sj G 5, then Sj = is the 

end of some block, and we gather 16 • ^ (/^ + /* ) + 7 [ (l + ^) /* - (l - ^) f^]+9{f*-fi) from 

parts 2, 3, and 4. So the total contribution in both cases is at most 92.5/* - (5.5 - < 92.5 f* -2.5 fi. 

If (T(oj) € C \ ^ and Si G then Si = s'^ is the end of some block. We gather 21(2/* — (1 — ai)fi) from 

n*, ra*, 

part 1, and 7[(l + -^)f* - (l - ^^)/i] + 9(/* - /i) from parts 3 and 4, which amounts to at most 
74.5/* — 5.5/i total contribution. Finally, if cr(oi) £ Z \ C, Si £ B and Sj 96 fT(oi), we gather 21(/* — /j) 

from part 1. If Sj does not belong to a block, we gather 16[/* — /« H — ^^^^^^(/i + /*)] from part 2. If Sj 
belongs to a block, then notice that it can only be the start s' of the block. So cr(oj) G ^2 and we gather 



7(/* - /j) + 9 • 2/* + 16 • — ^ (/j + /* ) from parts 3 and 4. Accounting for both cases, we gather at most 
70/* - 4/,. 

Now consider the {c*, Cj} -contribution. This includes the terms Ylj&D{{S'nS3)uH{znS))'^'^'^'j ^^'^ 
J2jeD(s'Q) bound the remaining contribution. Consider a client j S D*{o'q). Let i = iq. Its 

(remaining) part-1 contribution is 9(cj + Cj) if a{oi) G yl or Si £ B, Si cr{oi), and otherwise. If s'^ is 
not in any block, we pick up at most 16(c* — Cj) + 16c* from part 2. If s'^ is the start of a block, we pick 
up 7(c* — Cj) from part 3, and note that cj(oj) ^ A, s-i ^ i?. If s'^ is an intermediate S"-location of a block, 
then Si = s'q and Sj ^ cr(oj). We pick up 7{c*- — cj) from part 3, and 9(|c* + |cj) from part 4. So the total 
(remaining) contribution is at most 41c* — Cj in all cases. Now consider j E D*{oi), where Oi ^ O" . We 
have a{oi) E 5*2. Let Oj E Z^. We accrue 14(cj + c*) from part 1 if Sj E i? and otherwise. If s'^ does not 
belong to a block, we gather at most ^ [cj + c*) from part 2; if s'^ belongs to a block, it must be the start of 
the block, and we gather at most ^(cj + c*) from part 3. So in total, we accrue at most ^{cj + c*). 



Thus, (27 1 leads to inequality 



Lemma 6.5 Let Z £ C and r = |5"| > 0. Define s'^ = s'^ o'^ = o'^ ^^^^ ^.for all q. Then, 

0< 32/*+ 5^(92.5/* -/i)+ 32c* 

ieVc{z) idZ jeD{{S'nS3)uH(zns)) 

122 

i6D*(znO") ieD*({zno)\0") 



Proof : The arguments are almost identical to those in the proofs of Lemmas [6. 3 1 and 6.4 The only change 



is that we no longer have inequality (VJ) or ( |24l ). Instead, we start with the inequality < J2ip zi~fi + 



/* + Wi • c{oi, a{oi))), and we bound Wi ■ c{oi,a{oi)) suitably, as in Lemma 6.3 or Lemma 



6.4 to obtain 
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( |29l ) and ( [BO] ) that are analogous to ( [17] ) and (|24]l respectively 



< 



y:+ E (2/* - (1 - + E (/*-/*^ 



ieZ:cr(oi)eZ\yl 



ieZ:(T(oi)eA or 
a{oi)eC\A,Si&B 



ieZ:a{o,)eZ\C 



jeZ:o-(oi)eA jeD*(oi) 

or Si&B 



< 



E 2/;+ (2/* - (1 - a.)/.) + E 



i(^Z:SiKia{oi) or 



ieZ:cr(oi)eA or 

(T(oi)eC\A,sies 



ieZ:cr(o,)ez\c 

Sie-B,Si9i(T(oi) 



ieZ:o-(oi)eA or jeD*(oi) 
Si<^B,Si^a(oi) 



(29) 



(30) 



The rest of the proof proceeds as in Lemmas 6.3 and 6.4 we substitute ( |29l ) for ([17]) in the proof of 
Lemma [63] and substitute ( [30j ) for (24) in the proof of Lemma 6.4 It is not hard to see then that we 



obtain inequality (28 1. 



Theorem 6.6 The cost of a local optimum using 1 -swaps is at most 0(1) times the optimum solution cost. 



Proof : Let F and C denote respectively the total movement- and assignment- cost of the local optimum. 
For a set yl C 5", let center(A) denote |J^gy^{center(s)}. Summing ( [T2[ ) for all 3^82 and simplifying, we 
obtain that 

E E 5c* + 6c*. (31) 

i6D*(center(52)) ieD*(center(52)) jeD{S2) 

Summing ( [23] ) for every path Z ^ V, and ( [T3] ) or ( [28) for every cycle Z G C, yields the following. 



F + 



< 124.5 -F* + Y ^^S* + Y 32c* + Y 



41c* 



jeD*(0\center(S2)) 



jeDiSinSs) 



ieD*(0\center{S'2)) 



+ 



122 , 



(32) 



C,- + 



3 V^J ^ '-jJ- 



je-D*(center(S2)) 



Adding x ([31) to ([32]), we get that 



F + C < 124.5 • F* + ^ 74c* + ^ 32c* + ^ 250c* + ^ 41c* + ^ 249c* 



ieZ)(5o) 

< 124.5 • F* + 499 • C* 



jeDiSinSs) 



jG-D*(0\center(S'2)) jSD* (center(52)) 



7 Bad locality gap with arbitrary facility-movement costs 

In this section, we present an example that shows that if the facility-movement costs and the client-assignment 
costs come from different (unrelated) metrics then the p-swap local-search algorithm has an unbounded lo- 
cality gap; that is, the cost of a local optimum may be arbitrarily large compared to optimal cost. 

We first show a simple example for a single swap case, which we will later generalize for p swaps. 
Suppose we have two clients jo,ji and two facilities io,n. Some distances between these clients and 
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facilities are shown in the Fig. |2];a); all other distances are obtained by taking the metric completion. Note 
that in this example, in order to have a bounded movement cost for facilities, the only option is to have one 
of io,ji as a final location of facility io and one of ii, jo as a final location of facility ii. 

As can be seen from the figure, the solution O = {io,h} has total cost 2 (the movement cost is and 
the client-assignment cost is 2). Now consider the solution S = {jo,ji} which has a total cost of 2D (the 
movement cost is 2D and the client-assignment cost is 0). This is a local optimum since if we swap out Jq, 
then we have to swap in ii to have a bounded movement cost, which leads Jq having assignment cost of oo. 
By symmetry, there is no improving move for solution S, and the locality gap is D. 



{oo,D) 
n I ■ 1 30 






(l,oo) 




jo 


io 






(l,oo) 



(l,oo) 



Jp 



(l,oo) 



(oo,_D) 



(a) (b) 

Figure 2: Examples showing large locality gap for the cases where local search allows (a) single swaps (b) 
at most p simultaneous swaps. The label (a, b) of an edge gives client-assignment cost a and the movement 
cost 6 of a facility along that edge. 

Now consider the example shown in Fig.|2];b) for local-search with p simultaneous swaps. Suppose we 
have facility set {io, , • • • , ip} and client set {jo,ji , ■ ■ ■ , jp}- The global optimum O = {io,ii, ■ ■ ■ , ip} has 
total cost p+1 (facility movement cost is and client-assignment cost is (p+l)-l) while S = {jo, ji, ■ ■ ■ , Jp} 
is a local optimum whose total cost h{p+l)-D (facility movement cost is (p + 1) • and client-assignment 
cost is 0). Consider any move swap{X, Y). Note that j^ G X iff i^^i G Y (where indices are mod (p+l)) 
to ensure bounded movement cost. Let k be such that jk G X and j^+i ^ X. Then, jk has an assignment 
cost of cxD in the solution (5 \ X) U Y. Hence, is a local optimum. 
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